603800 


Approved  for  OTS  release 

^  cf^),  SO 


DDC 


ODC-IRA  C 


-7^ 


md 


^<fr/k<»7aUaH 


1700  MAIN  ST  •  SANTA  MONICA  •  CAUTOINIA 


RAOP  13,  R*v.  1-3-51 


SOME  BOUNDED  SIGNIFICANCE  LEVEL  PROPERTIES  0?  THE  EQUAL-TAIL  SIGN  TEST 

By  John  E.  Walsh 
The  RAND  Corporation 


1.  SjamaQ^.  In  addition  to  being  easily  applied  and  reasonably  efficient  for  tho 
ease  of  snail  samples  from  normal  populations,  the  equal-tail  sign  test  nr  the  popula¬ 
tion  median  is  valid  under  very  general  conditions.  Rarely,  if  ever,  howwer,  are  these 
conditions  exactly  satisfied  in  practice.  Thus  use  of  the  equal-tail  signVtest  for  the 
median  (under  the  assumption  that  the  conditions  on  which  it  is  based  hold)Yis  merely  an 
approximation  to  the  true  practical  situation.  Undoubtedly  the  equal-tail 
used  in  many  cases  where  these  conditions  are  only  roughly  satisfied.  This 

question  of  the  form  and  properties  of  the  test  which  is  approximated  by  _ 

sign  test  when  the  conditions  for  the  equal-tail  sign  test  are  not  satis fl^W^The 
purpose  of  this  paper  4ir  to  investigate  under  what  generalized  conditions  the  equal-tail 
sign  test  for  the  .Tiedian  is  a  satisfactory  approximation  to  the  corresponding  generalised 
test  in  the  sense  that  the  significance  levels  are  nearly  the  same.  It  *^fr^und  that 
the  significance  levels  of  the  two  tests  are  approximately  equal  for  a  wide  variety  of 
situations  if  the  number  of  observations  is  not  large.  Thus,  Tran  the  viewpoint  of 
significance  level,  the  equal-tail  sign  test  for  tlie  median  is  an  idealization  which 
represents  a  sufficiently  accurate  approximation  to  the  test  of  interest  for  many  practi¬ 
cal  cases.  This  significance  level  stability,  combined  with  its  other  favorable  proper¬ 
ties,  sug-’ests  that  the  equal-tail  sign  test  be  seriously  considered  for  application 
whenever  tlie  population  median  is  to  be  tested  on  the  basis  of  a  small  number  of 
observ.ations.  Corresponding  confidence  interval  prope:  i  les  are  also  derived. 

2.  Introduction  and  Discussion.  Let  us  consider  testang  whether  the 
median  ^  equals  a  given  hypothetical  value  for  situations  where  alternative  values  of 
the  median  greater  than  are  to  receive  the  same  emphasis  as  those  less  thanfthis 
value.  The  equal-tail  sign  test  represents  a  solution  to  this  problem  %diich  IM  at  grant 
practical  utility.  The  computation  required  for  the  application  of  an  equal-^il  sign 
test  is  small.  The  efficiency  of  these  tests  is  reasonably  hirh  for  small  sjlples  fton 
normal  populations  (see  [l]).  Also  the  equal-tall  sign  test  is  valid  under  very  general 
conditions.  These  conditions  only  require  that  the  observations  used  for  the  test  are 
statistically  independent  and  from  populations  which  satisfy 

(i).  The  populations  hiave  a  common  median  value  p. 

(ii).  No  population  has  a  discrete  amount  of  probability  concentrated  at 
Pq.  i.e.,  Pr(x=*p^)  =  0  for  each  population. 

Here  it  should  be  emphasi'/od  that  p  is  not  necessarily  unique;  there  mj^y  be  an  entire 
interval  of  points  which  satisfy  (i). 
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Situations  where  is  not  unique  but  represents  a  set  of  points  cause  little  diffi¬ 
culty  if  suitably  interpreted.  An  equal-tail  sign  test  of  the  null  hypothesis  p  ■  is 
merely  a  method  of  deciding  whether  p^  is  a  point  havin’  the  property  that  Pr(x  <  p^)  ■ 
Pr(x  >  p^)  ■  ^  for  each  population.  The  location  of  p^  among  the  ^0%  points  of  a 
population  is  usually  not  of  inportance.  Thus  the  null  hypothesis  P  ■  Pq  bas  the  inter¬ 
pretation  Pq  €  p. 

Let  the  n  independent  observations  on  which  a  test  is  based  be  denoted  by  y2*****^n* 
Subtract  p^  from  each  of  these  observations.  Then  n  non-zero  numbers  will  be  obtained 
(the  probability  of  the  number  zero  occurring  is  zero).  The  equal -tail  sign  test  for  the 
median  can  be  expressed  in  terms  of  the  signs  (♦  or  -)  of  these  numbers.  Let  p  be  the 
number  of  positive  signs  (then  n-p  is  the  number  of  negative  signs).  The  equal-tail  sign 
test  for  coaparing  p  with  the  given  hypothetical  value  p^  is 

Accept  p  /  p^  ^  either  P  >  i  or  p  <  n-i  , 

where  i  >  (n^l)/2«  An  equivalent  way  to  state  thin  test  is  in  terms  of  order  statistics. 
Let  represent  the  values  of  y2^»***»yp  arranged  in  increasing  order  of  magnitude. 

Using  order  statistics,  the  eoual-tail  si^^n  test  for  the  median  is 


(1) 

where  i  >  (n^l)/2, 
the  value 

(2) 


Accept  n  /  Pq  if  either  <  p^  or  >  p^  , 

The  significance  level  of  test  (l)  is  a  function  of  i  and  n  which  has 


Pr(x.  <  p)  ♦  Pr(x 


h-^l-i 


>  p) 


n-1 


n 

2 

s  -i 


nl 

si (n-s) I 


wl  en  conditions  (i)  and  (11)  hold. 

The  statement  of  the  equal-tail  sign  test  in  terms  of  order  statistics  is  convenient 
because  equal-tail  confidence  intervals  for  p  can  also  be  derived.  Since  i  >  (n^l)/2,  it 
follows  from  (1)  and  (2)  that 


(x 


n^l-i» 


is  an  equal-tail  confidence  interval  for  p  with  confidence  coefficient 

nl 


, , , n-1  n 

i-(4)  r 


s  »i 


I  ( n- 


3; 


if  (i)  and  (ii)  hold. 

When  conditions  (i)  and  (ii)  are  not  necessarily  satisfied,  test  (1)  is  no  longer 
exact.  Its  si  ;nificance  level  may  differ  sub  tanti.al'’y  from  the  value  of  (2).  The  null 
hypothesis  may  not  be  expressible  in  the  form  p  -=  p^.  In  many  cases,  however,  the  equal- 
tail  sign  test  furnishes  a  reasonably  close  approximati jh  to  a  fairly  large  class  of 
tests.  This  approximation  is  close  in  the  sense  that  each  test  of  the  class  has  a  sig¬ 
nificance  level  which  is  near  the  value  of  (2)  when  conditions  (i)  and  (ii)  are  even 
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roughly  satisfied.  The  principal  purpose  of  this  paper  is  to  define  this  class  of  tests 
and  investigate  their  significance  level  properties. 

First,  let  us  consider  the  form  and  properties  of  the  null  hypotheses  for  the  class 
of  teats  to  be  investigated.  Since  condition  (i)  is  not  necessarily  satisfied,  the  null 
hypothesis  can  no  longer  be  expressed  in  the  form  p  ■  Let  represent  the  median 
value  (or  set  of  median  values)  for  the  population  from  which  the  observation  y^  was 
drawn  (j  -  l,**»,n).  For  each  test  of  the  class,  the  null  hypothesis  is  required  to  be 
some  funct  .on  of  which  reduces  to  p  -  p^  when  condition  (i)  is  satisfied. 

Since  these  null  hypotheses  represent  generalizations  of  the  null  hypothesis  for  the 
sign  test  (p  -  Pq),  they  will  be  referred  to  as  generalized  null  hypotheses.  Hence  the 
generalized  null  hypotheses  considered  will  be  of  the  fora 


where  the  set  function  h  is  restricted  so  that  it  represents  the  set  of  50?  points  common 
to  ali  po  ulations  (denoted  by  p)  when  condition  (i)  holds.  If  h  is  not  unique,  the 
generalized  null  hypothesis  has  the  interpretation  p^^h. 

The  function  h  is  also  restricted  so  that  it  is  nearly  the  same  as  p  when  condition 
(i)  is  approximately  satisfied.  Stated  in  another  way,  the  function  chosen  for  h  should 
not  be  sensitive  to  oondition  (i);  i.e,,  a  moderate  deviation  from  the  existence  of  a 
common  median  value  should  not  have  an  appreciable  effect  on  h.  For  example,  let 
Pj^,***,P^  be  unique  and  large.  Then  the  function 


would  not  be  suitable  for  use  as  h  even  though  it  reduces  to  p  when  all  the 
value  p. 


have  the 


Now  let  us  define  the  class  of  tests  which  are  investigated  in  this  paper.  All  tests 
of  the  class  reduce  to  the  equal-tail  sign  test  when  conditions  Ti)  and  (ii)  hold. 
Consequently,  each  test  of  the  class  will  be  referred  to  as  a  generalized  test,  A 
generalized  test  is  defined  by 


(3)  Accept  h  /  p^  U;  either  <  p^  or  >  p^  , 

where  i  >  (n>l)/2.  Ttie  significance  level  of  this  test  equals 


(4)  Pr(x^  <  MjiXgeh)  .  Pr{Vl-l  ^  • 

The  value  of  (4)  is  not  completely  determined  by  i  and  n.  It  also  depends  on  many  other 
factors  such  as  the  populations  from  which  the  observations  were  drawn  and  the  value  of 
p^.  In  spite  of  this  inexactness,  the  value  of  (4)  is  usually  rather  closely  fixed  if  h 
is  a  reasonable  type  of  function  and  conditions  (i)  ani  (ii)  are  even  roughly  satisfied. 
Expression  (3)  defines  a  class  of  tests  rather  than  a  single  test  because  of  the  possible 
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choices  for  the  function  h. 

It  should  be  pointed  out  that  test  (3)  does  not  necessarily  have  equal -tails.  That 
is,  the  value  of  Pr(x^  <  is  not  necessarily  equal  to  the  value  of 

^  If'  extreme  cases,  test  (3)  mifht  even  be  one-sided. 

The  main  problem  of  the  paper  is  to  show  that  in  practice  the  value  of  (2)  is 
usually  a  close  approximation  to  the  value  of  (4).  This,  of  course,  is  not  always  true. 
For  example,  consider  the  case  where  some  or  all  of  the  populations  from  which  the 
observations  were  drawn  have  a  large  proportion  of  their  probability  concentrated  at  or 
near  the  median.  Then  the  value  of  (4)  may  differ  greatly  from  that  of  (2)  even  though 
conditions  (i)  and  (ii)  are  very  nearly  satisfied.  For  populations  of  the  type 
ortlinarily  encountered  in  practice  and  a  reasonable  choice  of  h,  however,  the  value  of 
(2)  is  usually  near  that  of  (4)  even  when  conditions  (i)  and  (ii)  are  only  roughly 
satisfied.  This  is  proved  by  obtaining  upper  and  lower  bounds  for  (4)  as  functions  of 
n,  i  and  a  quantity  3.  Here  0  is  defined  to  be  the  greater  of 


max 


Pr(yj  <  PqIPq!  h)  -  ||  , 


max  |Pr(yj  >  h)  -  ||  . 


If  P  -  0,  the  significance  level  of  test  (3)  equals  that  of  test  (1).  If  p  is  small, 
the  value  of  (4)  is  very  ne^r  that  of  (2),  Table  1  contains  upper  and  lower  bounds  for 
the  significance  level  of  test  (3)  for  p  »  ,02,  .05,  .00,  .10,  .15,  .20  and  n  <  15.  If 
the  populations  are  continuous  (or  very  nearly  so)  at  the  value  of  the  lower  bound 
is  noticeably  increased  (see  Table  2).  Thus,  for  n  <  15,  the  value  of  (2)  does  not  differ 
greatly  from  that  of  (4)  even  for  p  moderately  large.  A  value  of  p  as  large  as  .05  would 
seem  unusual  for  the  ordinary  type  of  practical  situation  where  there  is  reason  to  believe 
that  conditions  (i)  and  (ii)  are  satisfied. 

Let  us  consider  the  practical  implications  of  the  fact  that  the  equal-tail  sign  test 
approximates  test  (3)  in  the  sense  of  significance  level.  Suppose  the  experimenter 
recognizes  the  possibility  that  conditions  (i)  and  (ii)  may  not  hold  for  his  experiment. 

then  selects  the  function  h(p^,***,p^)  which  is  of  principal  interest  to  him  and  uses 
test  (3).  In  this  manner  he  obtains  an  accurate  test  of  the  null  hypothesis  in  which  he 
is  interested.  On  the  other  hand,  suppose  that  the  experimenter  iprlies  the  equal-tail 
sign  test  without  considering  the  possibility  that  conditions  (i)  and  (ii)  may  be 
violated.  The  results  of  this  paper  show  that  he  is  protected  if  the  appropriate  function 
h  (which  he  would  have  chosen)  and  the  populations  from  which  the  observ  tions  were  drawn 
are  of  a  reasonable  nature.  Then  he  is  testing  the  appropriate  null  hypothesis  at 
approximately  the  specified  si,‘7iificance  level  even  though  he  may  not  think  of  the  test 
in  this  light. 


Since  for  the  case  of  a  sample  from  a  nomal  population  thie  efficiency  of  the  equal- 
tail  sign  test  decreases  as  n  increases,  much  of  the  investigation  is  limited  to  tests 
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based  on  15  or  fewer  observations.  Table  1  contains  a  list  of  the  tests  investigated 
along  with  their  efficiency  for  noraality.  The  efficiency  of  a  significance  test  (more 
precisely,  the  power  efficiency)  is  defined  in  [l].  Intuitively  the  efficiency  of  a 
test  measures  the  percentage  of  available  information  per  observation  which  is  utilised 
by  that  test. 

The  equal-tail  sign  test  for  the  median  may  be  useful  for  situations  where  there  is 
not  much  information  available  concerning  properties  of  the  populations  from  which  the 
observations  were  taken.  Due  to  the  extremely  general  conditions  under  which  Its 
significance  level  is  approximate,  -r  deterained,  this  test  can  be  used  in  cases  where 
more  specialized  tests  are  not  necessarily  applicable. 

Approximate  confidence  intervals  for  h(pj^, •  •  •  ,p^)  can  be  obtained  from  teat  (3)» 

For  populations  of  the  type  usually  encountered  in  practice  and  a  reasonable  function  h, 


is  a  confidence  interval  for  h  with  confidence  coefficient  approximately  equal  to  unity 
minus  the  value  of  (2). 

The  toraterial  presented  in  this  paper  is  limited  to  investigation  of  Type  I  errors  of 
the  equal-tail  sign  test  when  t^ie  conditions  on  which  it  is  based  are  generalized.  Due 
to  the  extremely  general  situations  considered,  an  investigation  of  Type  II  errors  was 
not  feasible.  However,  the  results  obtained  for  the  particular  case  of  a  sample  from  a 
noitnal  population  indicate  that  the  efficiency  of  the  equal -tail  si/71  test  is  reasonably 
high  for  most  situations  if  the  number  of  observations  is  small, 

3.  Outline  of  Results,  This  section  contains  a  statement  of  the  main  results  of  the 
paper.  The  proofs  of  these  statements  are  given  in  section  4. 

The  method  followed  in  obt;ilning  bounds  for  {U)  consists  in  fixing  n,  i,  p  and  then 
finding  the  largest  and  simallest  values  of  (4)  possible  on  the  basis  of  thiese  and  apy 
additional  restrictions.  Thus  the  bounds  represent  the  vforst  possible  situations  for 
the  given  restrictions.  For  most  situations,  the  v.alue  of  (4)  would  likely  be  nowhere 
near  the  values  of  the  bounds.  Consequently,  for  most  cases  tFie  value  of  (2)  will  be 
much  nearer  (4)  than  is  indicated  by  the  upper  ani  lower  limits  in  the  tables. 

Let  us  consider  the  general  case  where  both  conditions  (i)  and  (ii)  could  be 
violated.  Values  of  uj'per  and  lower  bounds  for  the  significance  level  of  test  (3)  aa 
functions  of  n,  i  and  3  are  given  by 

upper  bound  -  [(i  .  a)\^  -  p)"  '  .  (i  -  s)“(i  .  9)"”] 


lower  bound 


(2  •  ^)  (I  *  0 


n-s 


(5) 
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Thus,  if  0  •  0  the  value  of  (4)  equals  (2)  while  if  0  is  smAll  the  value  of  (4)  is  very 
nearly  equal  to  (2),  Table  1  contains  values  of  these  upper  and  lower  bounds  for  the 
tests  considered,  A  visual  example  of  how  the  bounds  given  by  (5)  vary  as  functions 
of  0  for  fixed  n  and  i  is  given  by  Fig,  1,  which  contains  a  plot  of  these  bounds  for 
the  case  n  -  9,  1  ■  8.  If  0"^^  i  ujper  bound  — >  1  and  the  lower  bound  — >  0, 

A  case  of  practical  interest  is  that  where  condition  (i)  is  not  violated  to  any 
appreciable  extent;  i.e,,  none  of  the  populations  have  a  noticeable  amount  of  probability 
concentrated  at  Then  the  upper  bound  given  in  (5)  still  holds  but  the  lower  bound 

,  is  greatly  improved.  Tabic  2  contains  a  list  of  some  numerical  values  foi  this  lower 
bound.  These  values  aru  only  slightly  less  than  the  value  of  (2)  except  for  large 
values  of  0.  The  dotted  curve  in  Fig.  1  represents  a  plot  of  this  lower  bound  as  a 
function  of  0  for  the  case  n  ■  9,  i  ■  8, 

In  all  the  above  results,  the  n  observations  on  uldch  tests  are  based  were  assumed 
to  be  independent.  Although  no  analysis  will  be  made  for  cases  in  which  the  observations 
are  not  independent,  examination  of  the  significance  level  expression  (4)  for  test  (3) 
indicates  that  the  value  of  (4)  will  often  be  approximately  equal  to  (2)  when  the 
observations  are  mildly  dependent.  This  follows  from  the  intuitive  observation  that  in 
many  cases  dependence  changes  Pr(Xj^  <  ^  ^ 

that  one  probability  expression  is  increased  -/diile  the  other  is  decreased;  consequently 
the  value  of  (4)  tends  to  remain  near  that  of  (2), 

4*  Derivations.  The  purpose  of  this  section  is  to  present  derivations  of  the 
results  stated  in  the  preceding  sections , 

The  expressions  for  (2)  and  (4)  follow  from  i  >  (n>l)/2,  conditions  (i)  and  (ii), 
and  elementary  probability  considerations.  Consider  relations  (5).  Let 


Myj  <  liolMoth)  -  oj.  PHyj  -  ^  • 

Pr(yj  >  (  h)  -  i  •  Yj  ,  (J  - 

Then 
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where  the  notation  TT  denotes  the  product  over  those  values  of  J,  (J  ■  l,»*»,n),  which 
are  different  from  If  i  *  n,  each  double  summation  in  (6)  is  taken  to  be 


zero. 


Examination  of  (6)  shows  that  (4)  can  be  written  in  the  form 

^^“l»‘’*»“n*  ^1'“*' Vl*  “  V^°l»“*»°n»  ^l»***»Vl* 


where 


g(ai,*-*»anJ  ^j4l»"*»^n^  ^  0 


for  each  value  of  J.  Thus,  since  setting  £j  ■  0  places  no  additional  restrictions  on 
the  possible  values  of  the  a's  and  the  other  ^’s,  to  obtain  the  maximum  value  for  (4) 
all  the  €'s  should  be  zero.  Now  consider  (4)  with  all  the  Cs  equal  to  zero.  It  can 
be  written  in  the  form 


(7) 


u( 


for  each  value  of  j.  Since  -P  <  a  <  0  for  all  j,  the  maximum  value  of  (4)  is  obtained 
when  the  Oj  are  restricted  to  be  of  the  form 


(e) 


“J  ■ 


(J  - 


where  each  qj  equals  either  ♦!  or  -1.  Assume  that  an  arbitrary  but  fixed  choice  has  been 
made  for  the  t)^.  Then  (6)  shows  that  (4)  is  a  polynoml.il  in  a  which  is  an  even  function. 
Consider  the  coefficient  of  an  arbitrary  even  power  of  a  in  this  polynomial.  Examination 
shows  that  this  coefficient  is  naximum  (algebraically)  for  the  case  where  .all  the  are 
chosen  to  have  the  s.arae  value.  Hence  (4)  is  maximum  when 


(9)  -  0,  Oj  -  p,  (J  -  !,•••, n). 

Thus  the  upper  bound  for  the  significance  level  of  test  (3)  is  that  given  in  (5). 

Now  consider  the  lower  bound  for  (4).  Examination  of  (6)  shows  that  Pr(x^<4i^(p^  6  h) 
is  minimum  when  Oj  -  -p,  (j  -  l,***,n).  Similarly,  P**(^n+l-l^ol  ^o  ^  minimum  when 

Qj  ♦  ■  p,  (j  ■  !,•••, n).  Thus  (4)  is  minimum  when 

-  ^0,  Qj  -  -p,  (J  -  !,•••, n). 

Substitution  of  these  values  into  (6)  verifies  the  expression  given  in  (5)  for  the  lower 
bound. 

If  the  populations  for  test  (3)  all  satisfy  condition  (ii),  ■  0,  (j  -  l,***,n). 

From  (9),  the  upper  bound  of  (4)  ^iven  in  (5)  is  unchanged  for  this  case.  The  lower 
bound,  however,  can  be  noticeably  larger  than  the  value  stated  in  (5).  Since  for  each 
value  of  j,  (j  -  l,***,n),  the  value  of  (4)  can  be  expressed  in  the  form  (7),  the  lower 
bound  of  the  signific.ance  level  of  test  (3)  is  equal  to  the  minimum  value  which  can  be 
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obtained  for  (4)  when  the  Qj  are  restricted  to  be  of  the  form  (8)  and  the  have  the 
value  zero.  As  (4)  is  invariant  with  respect  to  permutations  of  problem 

of  obtaining  the  lower  bound  of  the  significance  level  of  test  (3)  is  reduced  to  that 
of  determining  the  number  m  of  the  t)j  which  equal  ♦!  when  the  resulting  value  of  (4)  is 
minimuB.  Since  the  lower  bound  for  the  significance  level  of  test  (3)  is  only  required 
for  n  <  15  and  i  >  n-3  (see  Table  2),  an  analytical  method  of  determining  the  value  of 
m  which  minimizes  (4)  will  not  be  developed;  the  values  fbr  the  lower  bounds  listed  in 
Table  2  were  obtained  by  substituting  numerical  values  for  m  and  computing  the  resulting 
values  of  (4).  For  example,  if  i  ■  n  and  m  of  the  -  *0  while  the  remaining  Oj  equal 
-p,  the  value  of  (4)  is 


If  i  <  n,  the  expressions  become  much  more  complicated  and  will  not  be  given  here. 
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